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Abstract 

For planar landmark based shapes, taking into account the non-Euclidean 
geometry of the shape space, a statistical test for a common mean first 
geodesic principal component (GPC) is devised. It rests on one of two 
asymptotic scenarios, both of which are identical in a Euclidean geometry. 
For both scenarios, strong consistency and central limit theorems are es- 
tablished, along with an algorithm for the computation of a Ziezold mean 
geodesic. In application, this allows to verify the geodesic hypothesis for 
leaf growth of Canadian black poplars and to discriminate genetically dif- 
ferent trees by observations of leaf shape growth over brief time intervals. 
With a test based on Procrustes tangent space coordinates, not involving 
the shape space's curvature, neither can be achieved. 
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1 Introduction 

In this paper the novel statistical problem of developing asymptotics for the 
estimation of the mean geodesic on a shape space is considered. It is the gener- 
alization to a non-Euclidean geometry of the asymptotics for the estimation of a 
straight first principal component line from multivariate data in the Euclidean 
geometry. Due to curvature involved, however, methods from linear algebra as 
employed in the Euclidean geometry cannot be used, and a new approach has 
to be developed. The task at hand is more involved, yet somehow compara- 
ble to the situation of generalizing the concept of the mean for multivariate 
data to a mean for manifold valued data. For such manifold valued means pio- 
neering work for definitions, existence, uniqueness, algorithms and asymptotics 
has been done by Gower (1975); Ziezold (1977); Kendall (1990); Goodall (1991); 
Hcndriks and Landsman (1996, 1998); Le (2001); Bhattacharya and Patrangenaru 
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(2003, 2005) and many others. In this work definitions for a mean geodesic, an 
algorithm and asymptotics are proposed and developed for data on Kendall's 
space of planar shapes. In particular, the following two different statistical 
scenarios are considered: asymptotics with respect to underlying shapes - the 
mean geodesic of shapes - and asymptotics with respect to underlying sampled 
geodesies - the mean geodesic of geodesies. 

The study of geodesies on shape spaces as the simplest model for a path of 
temporal evolution of shape is of high interest in shape analysis, in particular, 
in biological studies comparing growth patterns. 

Unlike previous attempts in the literature (e.g. Jupp and Kent (1987); 
Kent et al. (2001); Kume et al. (2007) ) building on a Euclidean tangent space 
linearization of the shape space, the mean geodesic of geodesies defined here 
builds on a Euclidean tangent space linearization of the space of geodesies of 
the shape space which has been introduced in Huckemann and Hotz (2009). 
Hence as a new and quite abstract concept, we treat here geodesies as data 
points. 

In application, in a joint research study on leaf growth with the Institute 
for Forest Biometry and Informatics at the University of Gottingcn, it turns 
out that it is precisely this subtle difference of linearizing the space of geodesies 
and not the shape space that successfully allows to discriminate genetically 
different Canadian black poplars by observation of leaf shape growth during a 
short time interval of the growing period. The research study presented here 
is fundamental for model building of leaf shape growth as well as for designing 
effective subsequent studies to investigate multiple endogenous and exogenous 
factors in leaf shape growth: E.g. since the beginning of the last century it has 
been well known that the leaf shape of (genetically) identical trees varies along 
a climate gradient (e.g. Brenner (1902); Bailey and Sinnott (1915); Royer et al. 
(2009)). Since Wolfe (1978) this relationship has been successfully exploited for 
paleoclimate reconstruction resulting in the "Climate Leaf Analysis Multivariate 
Program" (CLAMP, Wolfe (1993)). Naturally, the underlying studies have been 
based on the shape of mature leaves; little is known about the temporal evolution 
of shape along a climate gradient. The research presented here indicates that a 
study involving only very few measurements of growing leaves may allow for a 
fairly good reconstruction and analysis of growth patterns, further elucidating 
the relationship of climate and leaf shape. 

This paper is organized in a theoretical and an applied part. 

The theoretical first part consisting of the following two sections establishes 
the statistical theory for the two types of means. In Section 2, after a brief review 
of Kendall's space of planar shapes, the concept of a Frcchet mean is extended 
to the space of geodesies while the underlying random deviates assume values 
in the shape space. Strong consistency in the sense of Ziezold (1977) a well 
as in the sense of Bhattacharya and Patrangenaru (2003) are established. In 
the appendix it shown that the original arguments can be extended nearly one- 
to-one to the general case considered here. In order to apply the central limit 
theorem (CLT) of Huckemann (2010b), smoothness in geodesies of the square 
of the canonical distance between shapes and geodesies for geodesies close to 
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the data is established. Then in Section 3, smoothness is shown for the square 
of a metric of Ziezold type (cf. Huckemann (2010b)) for the space of geodesies 
leading to the other CLT. Finally, after establishing an explicit method for 
optimal positioning a fast algorithm for the computation of a mean geodesic of 
geodesies is derived. An algorithm for the mean geodesic of shapes has been 
derived earlier (Huckemann and Hotz (2009)). 

The applied second part introduces the leaf shape data considered, the driv- 
ing questions from forest biometry, statistical tests and some answers through 
the data analysis. In Section 4 the problem of discrimination by short growth 
observations is discussed. In particular, the relevance of the geodesic hypothesis 
from Le and Kume (2000) is noted for the devising of statistical tests in Section 
5. These are evaluated in Section 6 showing that only the test for common 
geodesies can establish the validity of the geodesic hypothesis and the discrim- 
ination of genetically different trees on the basis of observations of brief leaf 
shape growth. Section 7 concludes with a discussion and gives an outlook. 

2 The First Geodesic Principal Component for 
Planar Shape Spaces 

Throughout this work E(Y) denotes the classical expectation of a random vari- 
able Y in a Euclidean space MP , D € N. A distance 5 on a topological space 
r is a continuous mapping i : T x T - > [0, oo) that vanishes on the diagonal 
{(lil) '■ 7 € r}; in contrast to a metric, 5 is neither required to be non-zero off 
the diagonal, to be symmetric nor to satisfy the triangle inequality. 

Kendall's planar shape spaces In the statistical analysis of similarity shapes 
based on landmark configurations, geometrical m-dimensional objects (usually 
m = 2,3) are studied by placing k > m landmarks at specific locations of 
each object, cf. Figure 1 on page 11. Each object is then described by a ma- 
trix in the space M(m, k) of m x k matrices, each of the k columns denoting 
an m-dimensional landmark vector. The usual inner product is denoted by 
(x,y) := tr(xy T ) giving the norm ||x|| = y (x, x) . For convenience and without 
loss of generality for the considerations below, only centered configurations are 
considered. Centering can be achieved by multiplying with a sub-Helmert ma- 
trix from the right, yielding a configuration in M(m, k — 1). For this and other 
centering methods cf. Dryden and Mardia (1998, Chapter 2). Excluding also 
all configurations with all landmarks coinciding gives the space of configurations 

Ft := M(m,k -1)\{0}. 

Since only the similarity shape is of concern, in particular we are not interested 
in size, we may assume that all configurations are contained in the pre-shape 
sphere := {x G M(m, k— 1) : ||x|| = 1}. Then, all normalized configurations 
that are related by a rotation from the special orthogonal group SO(m) form 
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the equivalence class of a shape 



[x] = {gx : g G SO(m)} 
and the canonical quotient is Kendall's shape space 

Sf n := S^JSO(m) = {[x] : x G S*,}, with canonical projection p : 5^ -» . 

In this paper we restrict ourselves to planar configurations, i.e. to the case of 
m = 2. Then, complex notation comes in handy. For a detailed discussion of the 
following setup, cf. Kendall (1984, 1989) as well as Kendall et al. (1999). We 
take the notation from Huckemann and Hotz (2009). Identify Fj" with C fc_1 \{0} 
such that every landmark column corresponds to a complex number. This means 
in particular that z G C^'" 1 is a complex row(!)-vector. With the Hcrmitian 
conjugate a* = (afcj) of a complex matrix a = (a,jk) the pre-shape sphere S% is 
identified with {z G C^'" 1 : zz* = 1} on which SO{2) identified with 5" 1 = {A G 
C : |A| = 1} acts by complex scalar multiplication. Then the well known Hopf- 
Fibration mapping to complex projective space gives £ 2 = S^/S 1 = CP k ~ 2 . 

The spaces of geodesies Note that every geodesic can be parametrized 
by unit speed, which we assume in the following. Every great circle j(t) = 
xcost + ijsini, x, v G iSijf , (x, v) = is a geodesic on S%, the space of geodesies is 
denoted by T(S>2 )■ A great circle is called a horizontal great circle if additionally 
(ix, v) = 0, the space of horizontal great circles is denoted by T H (S2 )■ It is well 
known (e.g. Kendall et al. (1999); Huckemann and Hotz (2009)) that this space 
projects to the space r(Sj) of geodesies of the shape space via 

r(^) = { P o 7 : 7 er ff (s 2 fc )}. 

Then, with the Stiefel manifold (giving all great circles) 

2 (2,fc- 1) = {(x,v) G F 2 fc x F| :{x,x)=l = (v,v), (x,v) = 0} 

every tuple in the implicitly defined submanifold (additionally requiring hori- 
zontality) 

Of (2, k - 1) := {(x, v) G O a (2, k - 1) : (x, iv) = 0} 

corresponds to an element in T H (Si;)- Several tuples, however, may determine 
the same geodesic. To this end consider the action of the orthogonal group 0(2) 
and S 1 from right and left, respectively, by (x, v) 1— > h t (x, v) g^^ with 

9*,*=( C0S t " eShl M e O(2), h t = e*eS\ 
\ sm</> ecoscp J w 

for cj), t G [0, 2ir) and e — ±1 defined by 

(x,v)g ( j, i e = (x cos <f> + v sin <j>, ve cos <j> — xe sin tfi) , 
h t (x,v) = (e zt x,e lt v) . 
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For a manifold M with a group K acting from the right and a group G acting 
from the left, denote by 

K\M/G = {[P],P G M}, where [P] = {gPk : g eG,keK} 

the canonical double quotient, (e.g. Terras (1988)) With this notation we take 
the following from Huckemann and Hotz (2009). 

Theorem 2.1. The space of point sets of all geodesies on planar shape space 
can be given the canonical structure 

r(E§) S 0(2)\Of (2,fc- l)/^ 1 

o/ a compact manifold of dimension 4fc — 10. 

In analogy to the naming of the pre-shape sphere cau (2, ft — 1) the 
space of pre- geodesies. 

A simpler argument yields r(5*2 ) as the compact manifold 

T(S 2 fe ) = 0(2)\O a (2,fc-l). (1) 

Distance from shapes to geodesies and between geodesies The spher- 
ical distance r(p, 7) = arccos W (p, x) 2 + (p,v) 2 of a point p to the geodesic 7 
defined by (x,v) G 0^(2, fc — 1) naturally defines a distance 

/>([p], P 7) = jiiir^ r(e l *p, 7) 

of the shape [p] to the geodesic p o 7 in the shape space. For a shape [p] G 
denote by 

r^ 4 = { 7 Gr^):p([p],7)< J} 

the open set of geodesies closer to [p] than 7r/4. The proof of the following 
Theorem 2.2 is deferred to Appendix B. 

Theorem 2.2. For fixed p G the function 

7 !-> P(H,7) 2 

is smooth on Tj^ 4 . 

In order to measure the distance between geodesies equip 0(2)\0| r (2, fc— 1) 
with a suitable Ricmannian structure - two of such structures are straight- 
forward, cf. Edelman et al. (1998), or more simply, embed 0|^(2,fc — 1) in a 
Euclidean space and consider the quotient distance w.r.t. to the corresponding 
extrinsic metric. More precisely, we generalize a setup introduced by Ziezold 
(1994) on the quotient £§. 
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Definition 2.3. The Euclidean distance 

d(P,Q) := V\\x-y\\ 2 + \\v-w\\ 2 

for P = (x,v),Q = (y,w) G 0^(2, k — 1) C i 7 ^ x defines the canonical 
quotient distance 

5(\PUQ]):= min d(gPh,g'Qh') . 
h,ti e 0(2), 

for [P][Q] G r(E|) called the Ziezold distance on T(E^). 

The mean geodesic of shapes In earlier work (Huckemann et al. (2010b)) 
establishing a general framework for geodesic principal component analysis, the 
mean geodesic of shapes has been called a first geodesic principal component. 

Definition 2.4. Suppose that X, Xi, . . . , X n are i.i.d. random pre-shapes map- 
ping from an abstract probability space (CI, A, V) to S% equipped with its Borel 
a-algebra. For lu G call the geodesies 7n (w),7* G r(E 2 ) the first sample and 
population geodesic principal component ( GPC) of the sample [Xi(w)], . . . , [X ra (w)] 
and [X], respectively, if 

n n 

Vp([X 3 (w)], 7 „(w)) 2 = min Vp([X 3 H], 7 ) 2 , /or allwefi, 

E(p({X}, 7 *f) = min E(p([X], 7 f) . 
7 er(£*) 

The random set of all sample GPCs is denoted by En P \uj), E^ P \[X]) is the set 
of all population GPCs. 

Theorem 2.5 (Asymptotics for the mean geodesic of shapes). For i.i.d. random 
pre-shapes X, X\, . . . , X n the set of first sample GPCs En (lu) is a uniformly 
strongly consistent estimator of the set of first population GPCs E^ ( [X] ) in 
the sense that for every e > and a.s. for every lu G there is a number 
n(e,Lu) G N such that 

oo 

|J E { f°\uj) C { 7 G r(S*) : d( 7 ,E^([X})) < e) . 

j=n 

Moreover, if E^\[X}) contains a unique element 7* contained in 

n itf 

pGSupp(X) 

with the support Supp(X) of X , if ~/ n G E^ (lu) is a measurable selection and 
x = 0(7) G M 4fc ~ 10 are local coordinates near 7* with 4>(l*) = Oj then 

A\fn ^>( 7 n) — > Af(0,Ti) in distribution 
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with the (4k — 10) -dimensional normal distribution 7V(0, S) with zero mean and 
covariance matrix E = COV (grad 2 ,p(X, 7*) 2 ) and A = E(i/ K p(X, 7*) 2 )) ■ Here, 
grad x and H x denote the gradient and Hessian of p 2 , respectively, w.r.t. the 
coordinate x. 

Proof. The assertion of strong consistency is a consequence of the general The- 
orem A. 3 and Theorem A. 4 in the appendix. Since p([X],^f) 2 is smooth in 7 as 
long as 7 is closer to [X] than 7r/4, by Theorem 2.2, the assertion of the central 
limit theorem (CLT) follows from the CLT of Huckemann (2010b, Theorem A.l) 
since ^{^2) i s compact. □ 

Remark 2.6. For practical applications of Theorem 2.5, in a given chart A and 
£ could be estimated by classical numerical and multivariate methods. Alterna- 
tively, estimates can be obtained simply from the data's covariance in a chart 
around a sample mean. In particular in case of non-singular A, that covariance 
tends asymptotically to A~ 1 Y,(A~~ 1 ) T . 

Uniqueness and location of the first GPC The hypothesis of a unique 
first GPC is essential for the following framework. Clearly, that hypothesis 
translates to an anisotropy condition on the random shape. E.g. on the basis 
of the geodesic hypothesis for biological growth as detailed in Section 4, we 
may assume uniqueness in the application in Section 6. The development of 
a test for specific anisotropy would certainly be of merit for other potential 
applications. By definition, every first GPC will be close to the support of [X]. 
Data analysis and numerical simulations show that the intrinsic mean is usually 
very close to the first GPC (e.g. Huckemann and Hotz (2009); Huckemann et al. 
(2010b)). Moreover, for sufficient concentration, the intrinsic mean is unique 
and contained in a ball around the support of radius ir/4 (cf. Kendall (1990); Le 
(2001)). Certainly, further research is necessary to tackle questions of uniqueness 
and location. 

3 The Ziezold Mean of a Random First GPC 

We are now in the situation of having samples of first sample GPCs and to 
determine their Frechet mean w.r.t. to some distance. In order to apply a CLT 
we are aiming for a mean in a smooth sense. It turns out that the comparatively 
simple Ziezold distance features the desired smoothness. 

Theorem 3.1. The following hold: 

(i) the action of S 1 and 0(2) is isometric with respect to d, i.e. d(P,Q) = 
d(gPh,gQh) for all P,Q E Of (2,fc - 1) and all g eS 1 ,^ 0(2), 

(ii) S 2 is smooth and 5 is a metric on r^Eji). 

Proof. Property (i) is easily verified, in fact, the left-action of S 1 and right- 
action of 0(2) are even isometric on the ambient C fe_1 xC fe_1 . Moreover on F% X 
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F* the isotropy groups are {(go,i, ho), (g n ,i, h„)}. For this reason in consequence 
of the Principal Orbit Theorem (e.g. Bredon (1972, Chapter IV. 3)), 6 extends to 
the natural geodesic quotient metric on the manifold 0(2)\(F% x F^)/ S 1 . Hence 
in particular, S 2 is smooth on the submanifold r(S|). As another consequence, 
since the extension of S is a metric, 5 itself is a metric which yields (ii). □ 

In view of the application in Section 6, we now consider samples of inde- 
pendent random geodesies obtained from not necessarily independent shapes as 
typically occur during observation of growth. In particular, the test for common 
geodesies devised in Section 5 relies on the following Theorem 3.3. 

Definition 3.2 (The mean geodesic of geodesies). Call 7* G r(Ef) 

a population Ziezold mean geodesic of a random geodesic H if 

E(5(S, 7 *) 2 )= min £(£(-, 7 ) 2 ), 
7 er(s£) 

a sample Ziezold mean geodesic of random geodesies Hi, . . . , S n if 

n n 

£<KS»,7*) 2 )= min Y>(3» l7 ) 2 ). 

The sets of population and sample Ziezold mean geodesies are denoted by E^(E) 
and En\ui), respectively. 

Theorem 3.3 (Asymptotics for the mean geodesic of geodesies). For i.i.d. 
random geodesies H, Hi, . . . , H„ the set of sample Ziezold mean geodesies En (w) 
is a uniformly strongly consistent estimator of the set of population Ziezold mean 
geodesies £"^^(H) in the sense that for every e > and a.s. for every u) G f2 
there is a number n(e,uS) G N such that 

00 

|J Ef\u) C {7 G ml) : «J( 7l 25«(5)) < e} . 

j=n 

IfE^ s \S) contains a unique element^/*, 7„ G E n S \uj) is a measurable selection 
and x — 0(7) G R 4 '" -10 are local coordinates near 7* with (j>(j*) = 0, then 

A\fn </>(7n) — > A/"(0,E) in distribution 

with the (4fc — 10) -dimensional normal distribution Af(0, E) with zero mean and 
covariance matrix E = COV (grad^^H, 7*) 2 ) and A = ~E(H X 8(E, 7*) 2 )) • Here, 
grad x and H x denote the gradient and Hessian of d 2 , respectively, w.r.t. the 
coordinate x. 

Proof. Since S is a metric by Theorem 3.1, the assertion on strong consistency 
is a consequence of Ziezold (1977) as Bhattacharya and Patrangenaru (2003, 



Remark 2.5) teach. Since S is neither an intrinsic nor an extrinsic metric, the 
CLT of Bhattacharya and Patrangenaru (2005) cannot be applied. Rather, the 
assertion of the CLT follows from the more general CLT of Huckemann (2010b, 
Theorem A.l), since by Theorem 3.1, 5 2 is smooth and r(E^) is compact. □ 

For practical applications of Theorem 3.3 proceed as detailed in Remark 2.6. 

For P,Q £ O2 (2, k — 1), g € S 1 and h £ 0(2) call gQh is in optimal position 
to P, if d(P,gQh) = 6([P], [Q])- Since both groups 0(2) and S 1 are compact, 
given P £ 02(2, k — 1), every Q £ 6)2(2, — 1), can be placed into optimal 
position to P. Moreover, if [P*] is the unique Ziczold mean geodesic of sampled 
geodesies [Pi], . . . , [P„] then P* is the extrinsic mean of the gjPjhj placed into 
optimal position to P* , gj £ S 1 , hj £ 0(2), j = 1, . . . ,n, i.e. 

n 

P* = argmin Pe02(2 fc _ 1) ^ min , d ( p ,9jPjhj) , 



hj g 0(2), 



cf. Huckemann (2010c). The extrinsic mean then is the orthogonal projection to 
O2 {2, k — 1) of the classical Euclidean mean in ambient M (2, k — 1) x M(2, k— 1), 
cf. Hendriks and Landsman (1998); Bhattacharya and Patrangenaru (2003). 

In the first step we solve the problem of optimally positioning analytically, 
in the second step we compute the orthogonal projection. Based on the two, the 
algorithm of Ziczold (1994) is adapted, to compute the Ziezold mean geodesic. 

Theorem 3.4. Let P = (x, v),Q = (y.w) £ 0^(2, k - 1) and define 

A := (x,y) + e(v,w) , B := (x, w) - e(v, y) , 
C := (x,iy) + t{v,iw) , D := (x,iw) — e(v,iy) . 

Then, for g^^, h t putting Q into optimal position gtQh^^ to P , it is necessary 
that 

B + Dtant 

tan = — — 

A + Ct&nt 

and that t satisfies 
fi) 

r-, , C 2 + D 2 - A 2 - B 2 

tant = a ± y a z + 1, with a = 



2{AC + BD) 
in case of AC + BD ^ 0, 

(ii) t = in case of AC + BD = =/= A 2 + B 2 - C 2 + D 2 . 

-it/2 < t < ir/2 may be arbitrary in case ofAC + BD = = A 2 + B 2 - C 2 + D 2 

Proof. 

d{P,g t Qh^) 2 



i(f>({x, e lt y) + e(v, e lt w)^j + sin0^(x, e lt w) - e{v, e lt y)^j 



gives 



A-d{P,g t Qh^) 2 
2 

= A cos cj> cos t + B sin <f> cos t + C cos cj> sin t + D sin sin t . 

For fixed 0, a necessary condition for t = t(<f>) to maximize the above r.h.s. is 
that 

C cos 4> + D sin <fi C + D tan 6 

tan t = — = — . 

A cos <p + B sin <fi A + B tan q> 

Similarly, a necessary condition for <f> — <p(t) is that 

B cos t + D sin t B + D tan t 

tan < 



A cos i + C sin t A + Ct&nt 
Letting £ = tan £,77 = tan0 we obtain 

C + D V _ C(A + CQ + D(B + DC) 
C A + Bn A{A + CQ + B{B + DQ 

and, equivalently 

(AC + BD)( 2 + (A 2 + B 2 )( = (C 2 + D 2 )(+ (AC + BD), 
yielding the assertion. 



□ 



Theorem 3.5. Suppose that P = (x, v) € F% x F 2 fe , then (C, rf) g Of (2, fc - 1) 
is i/ie orthogonal projection of P to Of (2, fc — 1) if and only if 



T] = -^(v-(v,0(-(v,iOi() 



£ is arbitrary in case of (x,Q = 0, and r\ is arbitrary in case of (v,rj) = 0. 

Proof. Apply Lagrange minimization to \\x— £|| 2 + \\ v ~ v\\ 2 f° r Cj V € ^2 under 
the constraining condition $(£,77) = for 



I l-(x,x) \ 
1 - (v,v) 
2(x,v) 
\ 2 (a:,™) / 



□ 
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Algorithm to obtain a pre-geodesic of a Ziezold mean geodesic Let 

Pi , . . . , Pj be a sample of pre-geodesics. Starting with an initial value (a^ ) , i/ ' ) = 
p(0) := p lj say , obtain p(™+ x ) = (x ( - n+1 \ from PM = (x^ ,«(">) for 

n = 0, 1, ... by putting all Pj (j = 1, . . . , J) in optimal position P* = (y*,w*) 
to p(") by computing the corresponding <f)j,tj,€j from Theorem 3.4. Then, set 

(x,v) : = j fx^.xxj 

and let p(™ +:L ) be the orthogonal projection of (x,v) to 0^(2, fc — 1) from 
Theorem 3.5. 



4 Leaf Growth Data and Problem Statement 



Figure 1: Top row: typical leaf 
y>' \ growth over a growing period of 
/ \ a reference tree (left) and one of 
the two clones (right). Bottom 
S left: typical digitized leaf contour 
X and landmarks of the corresponding 
quadrangular configuration at peti- 
X ole, tip, and largest extensions or- 
thogonal to the connecting line. 



From leaf data to shape descriptors We consider leaf shape data collected 
from two clones and a reference tree of black Canadian poplars at an experi- 
mental site at the University of Gottingen. These data are similar but different 
from the data reported on in Huckemann et al. (2010a) and Huckemann (2010a). 
They consist of the shapes of 21 leaves from clone 1 and of 11 leaves from clone 
2 as well as of the shapes of 12 leaves from the reference tree, all of which 
have been recorded non-destructively over several days during a major portion 
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of their growing period of approximately one month (the maximal number of 
observations is 17, the minimal 2 with a median of 13). The top row in Figure 
1 shows typical contours of leaf growth. At each time point, from each leaf 
contour a quadrangular landmark based configuration has been extracted by 
placing one landmark at the petiole (where the stalk enters the leaf blade) , one 
at the leaf tip (the cndpoint of the main leaf vein) and two, each at the maxi- 
mal extensions orthogonal to the line connecting petiole and tip, cf. the bottom 
image of Figure 1. These four landmarks encode in particular the information 
of length, width and vertical and horizontal assymetry. As detailed in Section 
2, these landmarks additionally convey a correspondence of leaves with shapes, 
i.e. points in the shape space £|. This space is a non-Euclidean manifold and a 
special case of Kendall's landmark based shape spaces. 

The geodesic and parallel hypotheses for biological growth Investigat- 
ing landmark based configurations of rat skulls, Le and Kume (2000) observed 
that: 

the shape change due to biological growth mainly 
follows a geodesic in Kendall's shape space. 

In a research modeling the growth of tree-stem disks as well as leaf growth this 
geodesic hypothesis has been corroborated by Hotz et al. (2010). Jupp and Kent 
(1987); Kent ct al. (2001); Evans et al. (2006); Kume et al. (2007) have pro- 
posed more subtle models for shape growth essentially building on polynomials 
in Procrustes residuals (cf. Section 5 below). 

Additionally, Morris et al. (2000) observed parallel growth patterns and coined 
the parallel hypothesis, stating that Procrustes residuals of related biological ob- 
jects follow curves parallel in the Euclidean geometry of the tangent space at a 
Procrustes mean. In view of the geodesic hypothesis we restrict those curves to 
straight lines, generally however, not mapping to geodesies (cf. Figure 2). 

A brief discussion of the geodesic hypothesis In DArcy Thompson's 
seminal work Thompson (1917), biological form and growth of form has been 
explained by the invocation of the mathematical concpet of force. More recently, 
the relationship between growth and energy minimization has been explored by 
Bookstein (1978). These works have led Le and Kume (2000) to the above 
hypothesis, being aware that, firstly, geodesies depend on a specific geometry 
of a shape space, and secondly, even though many paths of growth seem to 
follow geodesies, there are examples where the geodesic fit is rather poor (e.g. 
Evans et al. (2006)). E.g. for the space of planar triangles, the hyperbolic 
geometry of the complex upper half plane introduced by Bookstein (1986) seems 
just as natural as the spherical geometry of the complex projective space in one 
complex dimension (Kendall's shape space for planar triangles, cf. Kendall 
(1984)). However, considering geodesies in Kendall's planar shape space as a 
rough working hypothesis, in particular for the leaf shapes in question, seems 
like a promising starting point for statistical investigation in the same way that 
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Figure 2: Tangent space of the 
two-dimensional £2 ( obtained 
from E| by leaving out one land- 
mark) under the inverse Rie- 
mann exponential at the in- 
trinsic mean corresponding to 
the extent of the overall data 
( clones 1+2 and reference 
tree). The left top vector is the 
affine parallel transport of the 
bottom vector in the Euclidean 
tangent space, the right top vec- 
tor is its intrinsic parallel trans- 
plant along a geodesic (dashed). 



approximate linearity has served statisticians well since the time of Gauss (or 
even earlier). 

Problem statement As visible in Figure 1 , the shape of leaves of the clones 
can usually be well discriminated from the shape of leaves of the reference 
tree by visual inspection. Following the geodesic hypothesis, the shape change 
under growth could be predicted from initial observations, ideally two initial 
observations would suffice. Since for the data at hand, the evolution of leaf 
contours have been followed elaborately along several time points, then the 
effort for future research could be cut down considerably. This leads to the 
following fundamental problem. 

Can leaf growth of genetically identical trees be predicted 
and discriminated from growth of genetically different trees 
on the basis of few initial measurements? 

In this research we restrict ourselves to measuring shape by four landmarks as 
detailed above. 

5 Tests for Shape Dynamics 

The precise definitions used in this section can be found in Sections 2 and 3. 
For every leaf considered a first geodesic principal component (GPC - a gen- 
eralization to manifolds of a first principal component direction) is computed 
either from its first two shapes (then the GPC is just the geodesic connect- 
ing the two, the correspondig group is called "young") or from the rest of the 
shapes (that goup is called "old") over the growing period (for an algorithm, 
see Huckemann and Hotz (2009)). For two groups (young vs. young, young vs. 
old and old vs. old of different trees) to be tested for a common mean first 
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GPC, the procedure detailed below produces data in a Euclidean space, namely 
in the tangent space of the space of geodesies at a mean geodesic. For compar- 
ison, a classical test for equality of mean shape as well as a test for equality of 
mean direction based on classical methodology described below, similarly pro- 
duce data in a Euclidean space, namely the tangent space of the shape space at 
a mean shape. For all three procedures, within the respective Euclidean space, 
the corresponding tests then test for a common mean via the classical Hotclling 
T 2 -test. Note that no two groups from the same tree are tested because of 
statistical dependence. 



sample sizes 20,20 sample sizes 10,10 

factor of covariance matrices: 2 factor of covariance matrices: 2 




0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 



p-values p-values 

sample sizes 20,10 sample sizes 20,10 

factor of covariance matrices: 2 factor of covariance matrices: 1 




0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 



p-values p-values 

Figure 3: Hotelling's T 2 '-test under nonnormality. The cumulative distributions 
of the empirical test statistics have been generated with 1,000 repetitions of two 
groups of sizes as displayed in the respective headers, with three-dimensional 
deviates uniform in a 3D interval centered at the origin, whose covariance ma- 
trices (which are determined by the dimensions and rotation of the interval) 
are constant multiples of one another up to a random rotation (fixed for every 
display). The respective headers give the corresponding constant factors. 

Robustness under nonnormality Recall that the Hotelling ^-distribution 
is a generalization of a Student T-distribution. As in the univariate case, the 
corresponding test statistic follows this distribution, if the coordinate data fol- 
low a multivariate normal distribution. Since the shape spaces considered are 
compact, obviously, we may never assume this central hypothesis. It is well 
known, however, that the corresponding statistic is robust to some extent under 
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nonnormality, one condition being finite higher order moments. Clearly, this 
condition is met on a compact space. Even more, it is known that asymptoti- 
cally the distribution of the corresponding statistic is unchanged under unequal 
change of covariances if the ratio of sample sizes tends to 1, e.g. Lehmann (1997, 
p. 462). The simulation in Figure 3 illustrates robustness for fairly small sam- 
ple sizes: under nonnormality and unequal sample sizes (bottom right display), 
and under nonnormality and unequal covariances with equal sample sizes (top 
row). The test is not robust under non- normality, however, if sample sizes and 
covariances differ considerably (bottom left display). 

The test for common geodesies Every first GPC computed above deter- 
mines a unique element in the space of geodesies r(S|) of Using the embed- 
ding of the space of pre-geodesics (2, 3) into C 3 x C 3 as detailed in Section 
3, these elements are orthogonally projected to the tangent space of the two 
group's Ziezold mean (an element of the space of geodesies r(S|)) thus giving 
data in a Euclidean space. The corresponding null hypothesis is then 

the temporal evolution of shape for every group follows a common geodesic. 

In other words, if ji are the first GPCs of leaf growth of groups i = 1, 2 to be 
specified later, then the null hypothesis states Hq : 71 = 72- This is a hypothesis 
on the mean geodesic of geodesies which can be tested by use of Theorem 3.3. 

Tests for common means Following the classical scheme (e.g. Dryden and Mardia 
(1998, Chapter 7)), all shapes of the two groups considered are projected to the 
tangent space of their overall Procrustes mean giving Procrustes residuals with 
the null hypothesis, 

the Procrustes tangent space coordinates of the temporal evolution 
of shape for every leaf have the same Euclidean mean. 

A theoretical note on related tests Taking instead the Procrustes resid- 
uals at the common Procrustes mean of the Procrustes means of every tem- 
poral evolution of shape would give a different test. If all leaves considered 
have equally many individual shapes then this second test is very closely ap- 
proximated by the first test. Otherwise, choosing suitable weights will give 
a close approximation. The goodness of the approximation can be numeri- 
cally confirmed, it also follows from the fact that only one effect is tested, cf. 
Huckemann et al. (2010b, pp. 2-4). Similar tests for static shape utilize intrin- 
sic or Ziezold means, respectively. In fact, for hypotheses on non-degencrate 
three-dimensional shapes one could only test hypotheses building on intrinsic 
or Ziezold means (because Procrustes means may lie outside the manifold part 
such that the CLT is not applicable, cf. Huckemann (2010c)). 

Tests for common directions Returning to the classical scheme, following 
Morris et al. (2000), compute the Euclidean first principal component of each 
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set of Procrustes residuals corresponding to the shapes of a single leaf's evolu- 
tion pointing into the direction of growth. For the analysis of the unit length 
directions we proceed as described above. Within the Procrustes paradigm, the 
residual tangent space coordinates of these directions at their common residual 
mean closer to the data are projected orthogonally to the Euclidean space of 
suitable dimension. The null hypothesis is then, 

the Procrustes tangent space coordinates of the temporal evolution 
of shape for every leaf share the same first Euclidean PC. 

This is the version of the parallel hypothesis for this paper. If the static mean 
shapes of the two groups considered arc different, then the common Procrustes 
mean depends on the ratio of the two sample sizes. Moreover, even for common 
static shape, the null hypothesis incorporates effects of curvature, cf. Figure 2. 

6 Discriminating Canadian Black Poplars by Par- 
tial Observation of Leaf Growth 

For the following tests, as the first groups called "young" in the following, the 
first two initial shapes from every leaf considered have been taken and the unique 
geodesic joining the two has been computed. For the second groups, called "old" 
in the following, for every leaf considered the first GPC of the rest of the shapes 
has been computed, if the number of the rest of shapes exceeded 3. For this 
reason, the number of GPCs in some of these groups is possibly smaller then 
the number in corresponding groups of "young" . To the respective groups the 
three tests introduced in Section 5 have been applied. The results are reported 
in Table 1 in different order, however, since the tests for common geodesies and 
for common directions test for closely related concepts. 

As visible in Table 1, the test for common geodesies (first data column) 
allows to discriminate very well genetically different trees from genetically iden- 
tical trees by the observation of growth over restricted time intervals only. Due 
to curvature (cf. Figure 2) in the first box for genetically identical trees, when- 
ever over these restricted time intervals the means differ (ultimate data column, 
i.e. when the growth of young leaves is compared to the growth of old leaves), 
then the directions also differ highly significantly (middle data column). For 
genetically different trees (third box) all of the group means differ highly sig- 
nificantly (ultimate data column) and most of the directions as well. Note that 
genetically different young leaves cannot be discriminated by their directions. 
This can be explained by comparison with the bottom right display of Figure 
4: leaf shapes of young leaves tend to be comparatively close to each other. 

Figure 4 depicts the first two dominating coordinates (explaining between 
80 % and 90 % of the total variation) of a tangent space projection of the overall 
dataset (young and old) for clones and the reference tree. In contrast to the 
test for common geodesies (cf. top display), almost all of the different groups 
(young vs. old of clone 1, clone 2, and the reference tree) can be discriminated 
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by the directional data (bottom left display, e.g. the red and black crosses 
tend to lie on the l.h.s., the red and black circles on the r.h.s.). In all of the 
first two displays (top and bottom left), for the clones, the variance of the 
"young" data appears slightly larger than the variance of the "old" data. For 
the reference tree, however, this is not the case. This effect is not visible when 
observing means only (bottom right display). As discussed in Section 5, unequal 
covariances may be troublesome w.r.t. to the validity of the Hotclling T 2 -test 
employed if sample sizes are not approximately equal. Considering in Table 1 
only the samples of similar sizes 9, 11 and 12, however, comparable classification 
results are obtained. 

Conclusion From this study we conclude the following. 

(a) Clone and reference tree can be discriminated by partial observations of 
leaf shape growth not necessarily covering the same interval of the growing 
period via the test for common geodesies. This is not possible via a test for 
common means (due to temporal change of shape) or common directions 
(due to curvature). 



dataset 1 


dataset 2 


geodesies 


directions 


means 


clone 1 young (21) 


clone 2 old (11) 


0.75 


1.6e - 04 


4.7e - 08 


clone 1 young (21) 


clone 2 young (11) 


0.97 


0.82 


0.71 


clone 2 young (11) 


clone 1 old (20) 


0.066 


0.0021 


5.5e - 07 


clone 1 old (20) 


clone 2 old (11) 


0.17 


0.21 


0.71 


correct classification of clones: 








at 95 %-level 


100.00 % 


50.00 % 


50.00% 


at 99 %-level 


100.00 % 


50.00 % 


50.00 % 


clone 1 young (21) 


reference young (12) 


0.0012 


0.65 


6.5e - 06 


clone 2 young (11) 


reference young (12) 


0.0043 


0.79 


0.0015 


clone 1 young (21) 


reference old (9) 


0.0067 


0.0077 


1.9e - 07 


clone 2 young (11) 


reference old (9) 


0.026 


0.023 


1.0e-05 


clone 1 old (21) 


reference young (12) 


0.00022 


0.0013 


2.4e-06 


clone 2 old (11) 


reference young (12) 


0.0092 


0.014 


0.0046 


clone 1 old (21) 


reference old (9) 


0.087 


0.023 


0.0014 


clone 2 old (11) 


reference old (9) 


0.021 


0.018 


0.0068 


correct classification of the reference tree 








at 95 %-level 


87.50% 


75.00% 


100.00 % 


at 99 %-level 


62.50% 


25.00% 


100.00 % 


correct classification: 








clones vs. 


reference tree 








at 95 %-level 


93.75% 


62.50% 


75.00% 


at 99 %-level 


81.25% 


37.50 % 


75.00 % 



Table 1: Displaying p-values for several tests for the discrimination of clones 
from the reference tree via leaf growth ("young" denotes the dataset comprising 
the first initial two observations and "old" the dataset comprising the rest of 
the observations) . For convenience, the sample size (number of different leaves 
followed over their growing period) of the corresponding data set is reported in 
parentheses. 
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Geodesic 




Figure 4: Projection of leaf shape growth from two clones (red for clone 1 and 
black for clone 2) and a reference tree (green) to the dominating coordinates 
of the corresponding tangent space at the overall mean as detailed in Section 5. 
Each cross represents a single leaf's initial shape evolution over two observations 
(young), each circle represents the rest of the leaf's shape evolution during its 
observed growing period (old). Top: GPCs projected to the tangent space at 
the Ziezold mean geodesic. Bottom row: all shapes have been projected to the 
tangent space of the overall Procrustes mean. Bottom left: unit directions of 
first Euclidean PCs. Bottom right: Euclidean means. 



(b) The "geodesic hypothesis" has been validated by use of the test for com- 
mon geodesies, notably it could not have been validated using the test for 
common directions. 

(c) For a statistical prediction of future leaf shape growth, two initial obser- 
vations suffice. 

Application Let us elaborate on one consequence of conclusion (a). Suppose 
that we have several leaf shape growth data of clones and the reference tree of 
short but arbitrary time intervals (young, old, intermediate, etc.). Then most 
likely the test for common means will not be able to identify the clone from 
the reference tree, because the mean shapes of the different time intervals will 
most likely be different even for the same leaf considered. Similarly, due to 
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curvature the test for common directions will fail, unless all data are jointly 
highly concentrated. It is only the test for common geodesies that may furnish 
the desired discrimination. 

7 Discussion 

The geodesic hypothesis of Le and Kume (2000) - its scope and limitations have 
been discussed in Section 4 - has been corroborated in many scenarios of biolog- 
ical growth. In this paper a test for common geodesies for this hypothesis has 
been devised and successfully applied to the problem of discriminating poplar 
leaf growth based on two initial observations only. For computational feasibility, 
not the concept of an intrinsic mean geodesic but rather that of a Ziezold mean 
geodesic has been employed. One can thus call the test devised a semi-intrinsic 
test. The semi-intrinsic test for comman geodesies has been compared to a test 
for common directions building on directions in the space of Procrustes residuals 
(following Morris et al. (2000)). This is essentially a non-intrinsic test because 
it linearizes the shape space and not the space of shape descriptors tested for. It 
turned out that for the discrimination task at hand, curvature present rendered 
this test ineffective. The author is not aware of any other test for the geodesic 
hypothesis in the literature. 

In this work we have considered two types of mean first GPCs, one defined 
by a sample of GPCs with underlying samples of random shapes, which - like 
growth patterns - are obviously dependent. For independent sampling the other 
mean first GPC has been defined directly by the shape data. Since p (defining 
the latter) is different from 8 (defining the former, cf. Section 2) - as a mani- 
festation of 'inconsistency' (cf. Kent and Mardia (1997); Huckemann (2010b)) 
- the limit of mean random geodesic of geodesies and the population geodesic 
of shapes may be different as well. Studying their relationship, however, may 
provide further insight. 

In conclusion let us ponder on extensions and generalizations of this re- 
search. One may view all shape descriptors as generalized Frechet means on 
suitable spaces. For geodesies on Kendall's planar shape spaces, we have pro- 
vided an explicit framework using a Ziezold mean geodesies which can be com- 
puted fairly easy. Straightforward but considerably more complicated is the use 
of intrinsic mean geodesies. At this point we note that the space of generalized 
geodesies on Kendall' shape space for dimension m > 3 ceases to be a manifold. 
Like a shape space, it can be viewed as the quotient of a Riemannian mani- 
fold modulo a Lie group action. In contrast to Kendall's shape spaces, the top 
space itself (a submanifold of a Grassmannian) admits two canonical geometries 
(cf. Edelman et al. (1998)). For the embedding underlying the Ziezold mean 
geodesic, we have used the simpler of the two. Possibly, this framework extends 
to other one-dimensional shape descriptors, such as arbitrary circles on spheres 
(cf. Jung et al. (2010a)), or more generally, the family of constant curvature 
curves, as well as to higher dimensional shape descriptors (for geodesic descrip- 
tors cf. Huckemann et al. (2010b), for non-geodesic descriptors cf. Jung et al. 
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(2010b)). Thus, (semi)-intrinsic inference on any of such descriptors may be 
possible. 

One final word of caution: even for fairly simple spaces such as the torus 
or the surface of an infinite cylinder, the canonical topology of the space of 
geodesies is non-Hausdorff (cf. Beem and Parker (1991)) and thus may not 
admit any meaningful statistical descriptors. 
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A Strong Consistency 

For this section suppose that X, X\,X<2, ■ ■ ■ are i.i.d. random elements mapping 
from an abstract probability space (Q,A, V) to a topological space Q equipped 
with its Borel <7-field; (P, d) denotes a topological space with distance d. 

Definition A.l. For a continuous function p : Q x P — > [0, oo) define the set 
of population Frechet p-means of X in P by 

EV)(X) = argmh VeP E(p(A, M ) 2 ) . 

For uj G £1 denote by 

n 

E n p \u) = argmin M6 p ^ p{X j (w),n) 

3 = 1 

the set of sample Frechet p- means. 

By continuity of p, the mean sets are closed random sets. For our purpose 
here, we rely on the definition of random closed sets as introduced and studied 
by Choquet (1954), Kendall (1974) and Matheron (1975). Since their original 
definition for P = Q, p = d a metric by Frechet (1948) such means have found 
much interest. 

We will work with the following two definitions of strong consistency, each 
has been coined as such for metrical Frechet means by the respective authors. 

Definition A. 2. Let En (u>) be a random closed set and a deterministic 
closed set in a space with distance, (P,d). We then say that 
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(ZC) En\uj) is a strongly consistent estimator in the sense of Ziezold (1977) 
of E^ if for almost all uj £ ft 



n— 1 k—n 



(BPC) En (uj) is a strongly consistent estimator in the sense of Bhattacharya and Patrangenaru 
(2003) of E^ if £W ^ and if for every e > and almost all uj £ ft 
there is a number n = n(e,u)) > such that 



\jE { k P \uj)ci{p£P:d(E^,p)<e}. 

k 



For quasi-metrical means on separable (i.e. containing a dense countable 
subset) quasi-metrical spaces, Ziezold (1977) proved (ZC). For metrical means 
on spaces that enjoy the stronger Heine-Borel property (i.e. that every bounded 
closed set is compact), Bhattacharya and Patrangenaru (2003) proved (BPC) . 
One may argue from a statistical point of view that for "consistency" one would 
want to have equality of points, and if not possible, at least an equality of sets 
(the set of 3D Procrustes means always contains at least two antipodal points, 
e.g. Huckcmann (2010b)), rather than an inclusion only. Even though it would 
be interesting to construct an example with strict inclusion, it seems that this 
case has no relevance in applications. 

In order to generalize Ziezold's and the Bhattacharya-Patrangenaru Strong 
Consistency Theorem we introduce two properties. A continuity property in 
the second argument uniform over the first argument - a consequence of the 
triangle inequality if p is a quasi-metric - and a version of coercivity in the 
second argument - again valid if p is a quasi-metric: 



for every x £ Q,p £ P and e > there is a 8 = S(e,p) > 
such that \p(x,p') — p(x,p)\ < e for all p' £ P with d(p,p') < S 

there are po £ P and C > such that V{p(X,p ) < C} > and 
that such that for every sequence p n £ P with d(po,p n ) — > oo 
there is a sequence M n — > oo with p(x,p n ) > M n for all x £ Q 
with p(x,po) < C. Moreover, if p n £ P with d(p*,p n ) — > oo 
for some p* £ M, then d(po,p n ) — > oo. 



(2) 
>(3) 



Theorem A. 3 (Ziezold's Strong Consistency). Let p : Q x P — s- [0,oo) be a 
continuous function on the product of a topological space with a separable space 
with distance (P,d). Then strong consistency holds in the Ziezold sense (ZC) 
for the set of Frechet p-means on P if 

(i) X has compact support, or if 

(ii) E(p(A,j>) 2 ) < oo for allp £ P and p is uniformly continuous in the second 
argument in the sense of (2). 
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Proof. Obviously under (i) for every p £ P and e > we may assume that there 
is S = 5(e 7 p) such that \p(X,p') — p(X,p)\ < e a.s. if d(p,p') < S. With this is 
mind, it suffices to prove the assertion under (ii). 
For uj G Q, p G P set 

Fn(p) = H:Up{XMiP)\ F(Q) = HP(X,P) 2 ), ) 

£ n = m{ peP F n (q), t = M pGP F(p), M4) 

E n = { P eP:F n (p)=£}, E = { P eP:F(p)=£}. J 

We now follow the steps laid out in Ziczold (1977). Let pi,p2, ■ ■ ■ be dense in 
P. From the usual Strong Law of Large Numbers in K we have sets A k C Q, 
V(Ak) = 1 such that F n (pk) n —?° F(p k ) for every k = 1,2,... and uj G Afc. 
Setting A := flj^ Afc we have hence for all p = pk, k = 1, 2, . . . 

F n {p) ™ f (p) for all uj e A, T(A) = 1. (5) 
Next, let p,p' G -P. Setting f(q,p',p) '■= p(q,p') — p(q,p) we have then 

n 

|F„(p')-F n (p)| < -Y,(p{Xj,l/) + P(Xi,P))\p(Xj,l/)-p(Xi,P)\ 

r sE-=i(2pft-,p')+/(^ 1 p 1 j''))i/ft- 1 p,P')i 

I iE;=i(2p(^,p) + /(^,p',p))|/(X j ,p,y)l 

W.l.o.g. we may suppose that — > p G P. In consequence from the top line of 

(6) for p' = p k , : 

£E?=iP(*iM,p fc ) 2 - £E?=i (2p(^-,») + |/(x i ,p,p fc )|)|/(x i ,p, Pfc )| 

<^Ei=iP(^P) 2 )< 
sE?=ip(^(«).w) j + £E?=i (2p(x,-,p fc ) + l/(x,-,j>,p fc )l)l/(^,i>,Pfc)l 

Taking the expected value, i.e. employing (5) at pu, by hypothesis (i) or (ii) as 
explained in the beginning of the proof, for arbitrary e > we may assume that 
there is a S > such that for all d(pk ,p) < 6 

E(p(X, Pk ) 2 ) - (m(p{X,p k )) +e)e 
< Uminfn^oo i E"=i p(Xj,pf < limsup„^ oc ± E"=i P( x j,P) 2 < 
E(p(X,pk) 2 ) + (2E(p(X,p k ))+ey. 

Letting e — > we can choose a subsequence p ki — > p, hence, this yields the 
validity of (5) for all p G P. 
Let us now extend (5) to 

F n {p n ) n ^-°° F(p) for all sequences p n — > p and uj g A, V(A) = 1 . (7) 
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Utilizing the bottom line from (6) yields 



1 - 

F n ( Pn ) - F n (p)\ < -^2(2p(X j ,p) + \f(X jt p n ,p)\)\f(X jt p,p n )\ 



0. 



n 



Hence as desired \F n (p n ) - F(p)\ < \F n (p n ) - F n (p)\ + \F(p) - F„(p)\ 0. 
Finally let us show 



if n£° =1 U^ n £„ ^ then £ n -> £ (8) 



Note that, the assertion of the Theorem is trivial in case of n^° =1 U^L ri £' n = 0. 
Otherwise, for ease of notation let B n := Wj*L n Ek, B n \ B := r\^ =1 B n , b £ B. 
Then b £ B n for all n £ N. Hence, there is a sequence fe„ e B„, 6 n — > 6. 
Moreover there is a sequence fc„ such that b n = pk n £ for a suitable fc„ > n. 
Then £„ fc = F Uk (p Uk ) — > F(b) > £ by (7). On the other hand by (5) for arbitrary 
fixed p £ P there is a sequence e n — > such that F(p) > F n {p) — e n > £ n — e n . 
First letting n — >■ oo and then considering the infimum over p £ P yields 

£ > lim sup £ n . 

n— y oo 

In consequence £ n — > £ = ^(fe). In particular we have shown that b £ E ^ % 
thus completing the proof. □ 

Theorem A. 4 (Bhattacharya-Patrangenaru's Strong Consistency). Suppose 
that (ZC) (Ziezold's strong consistency) holds for a continuous function p : 
Q x P — > [0, oo) on the product of a topological space with a space with distance 

{P,d). If additionally ^ U™ =1 E ( n p) (w) enjoys the Heine-Borel property 

for almost all lo £ £1 and the coercivity condition (3) in the second argument 
is satisfied then the property of strong consistency (BPC) in the sense of Bhat- 
tacharya and Patrangenaru is valid. 

Proof. We use the notation of the previous proof. Consider a sequence q n £ E n 
determined by 

d(p, E) = max d{p, E) =: r n . 

PS-En 

If r n y^Owe can find a subsequence njt such that r nk > ro > 0. In consequence, 
every accumulation point of p nk has positive distance to E, a contradiction to 
strong consistency. Hence cither r n — > or there are no accumulation points. 

We shall now rule out the case that there are no accumulation points. In 
view of the Hcinc-Borcl property, this case can only occur for r n — > oo. Under 
condition (3) there is po £ P, a subsequence k(n) and for given n a subsequence 
m, . . . , riMn) of 1, 2, ... ,n such that p(X nj ,po) < C for all j = 1, . . . , fe(n), a.s., 
and 

^-^V{p(X,p )<C}>0. 
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Also, by hypothesis and condition (3), d(po,p n ) — > oo. Moreover by condition 
(3), there is a sequence M n — !> oo with 



1 ^ kin) 
F n (Pn) > -)p(X n Auj), Pn ) 2 > -±-LMl- 

3 = 1 



On the other hand for any fixed p <E E, we have the Strong Law on K, £ n < 
Fn{p) —> F{p) = ^ a.s. , yielding a contradiction. 

□ 

B Proof of Theorem 2.2 

Proof. Consider for fixed p g S% the non-negative 0(2)-invariant smooth func- 
tion on Of (2, k — 1) defined by 



f p (x,v) := f arccos >/(p, a;) 2 + (p, 



2 

2 ' 



Then the condition 



f P (£x,t;v) = min fJe lt x,e lt v) = arccos [ max i{e lt p,x) 2 + (e u p,v)' 

defines two smooth explicit branches ±£ : Of (2, fc— 1)\M° — ► S 1 (cf. Huckemann and Hotz 
(2009, Theorem 4.3)), outside the singularity set 

M° = {(x, v) g Of (2, fc - 1) : L>(.t, w) = = v) 2 - v) 2 } 

using the notation from Huckemann and Hotz (2009): 

A(x,vf = (x,p) 2 + {v,p) 2 , 
B(x,v) 2 = (x,ip) 2 + (v,ip) 2 , 

D(x,v) = 2{{x,p)(x,ip) + (v,p)(v,ip) S j . 

One verifies that M° is bi-invariant, i.e. invariant under the left action of 

S 1 and the right action of 0(2). Hence, on 0(2)\(bf (2, k - 1) \ M^/S 1 , 

the square of the distance p([p], [7]) agrees with the value of the bi-invariant 
function (x, v) i-» fp(£(x, v)x, £(x, v)v), hence 

po 7 h->p 2 ([p],po7) 
is smooth on 0(2)\(of (2, k - 1) \ M°) /S 1 . 



Finally, we show that the geodesies in r(Ef) determined by M° are at least 
7r/4 away from [p]. Obviously, geodesies determined by A(x, v) 2 = = B(x, v) 2 
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have distance 7r/2 to [p\. Suppose now that (x, v) £ M° with A(x,v) 2 > 0. 
W.l.o.g. assume that (x,p) ^ 0. Then 

{x,p) z 

which implies that (v,ip) 2 = (x,p) 2 . In consequence we have also (v,p) 2 = 
(x,ip) 2 and, in particular, sign((x,p)(if, ip)) = — sign((a;, ip)(v,p)) =: e. Then, 
we have for the shape distance p to [p] of shapes along the geodesic 7 through 
[x] with initial velocity v at [x] that 

cos/?([p],7(s)) = max (pcost + ipsini, xcoss + usins) 

0<i<27r 

= max I (x, p) cos(i — es) + (v,p) sin(f — es) ) 
= max(|(x,p)|,|(i),p)|^ 
is constant, giving as desired p{[p], 7) > tt/4. □ 

References 

Bailey, I. W., Sinnott, E. W., 1915. A botanical index of creaceous and tertiary 
climates. Science 41 (1066), 831-834. 

Beem, J. K., Parker, P. E., 1991. The space of geodesies. Geometriae Dedicata 
38 (1), 87-99. 

Bhattacharya, R. N., Patrangenaru, V., 2003. Large sample theory of intrinsic 
and extrinsic sample means on manifolds I. Ann. Statist. 31 (1), 1-29. 

Bhattacharya, R. N., Patrangenaru, V., 2005. Large sample theory of intrinsic 
and extrinsic sample means on manifolds II. Ann. Statist. 33 (3), 1225-1259. 

Bookstein, F. L., 1978. The Measurement of Biological Shape and Shape 
Change, 2nd Edition. Vol. 24 of Lecture Notes in Biomathematics. Springer- 
Verlag, New York. 

Bookstein, F. L., 1986. Size and shape spaces for landmark data in two dimen- 
sions (with discussion). Statistical Science 1 (2), 181-222. 

Bredon, G. E., 1972. Introduction to Compact Transformation Groups. Vol. 46 
of Pure and Applied Mathematics. Academic Press. 

Brenner, W., 1902. Klima und Blatt bei der Gattung Quercus. Flora 90, 114- 
160. 

Choquet, G., 1954. Theory of capacities. Ann. Inst. Fourier 5, 131-295. 



25 



Dryden, I. L., Mardia, K. V., 1998. Statistical Shape Analysis. Wiley, Chich- 
ester. 

Edclman, A., Arias, T. A., Smith, S. T., 1998. The geometry of algorithms with 
orthogonality constraints. SIAM J. Matrix Anal. Appl. 20 (2), 303-353. 

Evans, K., Dryden, I., Le, H., 2006. Shape curves and geodesic mod- 
elling. Manuscript, http: / /www. maths. nottingham.ac.uk/personal / 
ild / papers /curves2 .pdf . 

Frechet, M., 1948. Les elements aleatoires de nature quelconque dans un espace 
distancie. Ann. Inst. H. Poincare 10 (4), 215-310. 

Goodall, C. R., 1991. Procrustes methods in the statistical analysis of shape 
(with discussion). Journal of the Royal Statistical Society, Series B 53, 285- 
339. 

Gower, J. C, 1975. Generalized Procrustes analysis. Psychometrika 40, 33-51. 

Hcndriks, H., Landsman, Z., 1996. Asymptotic behaviour of sample mean loca- 
tion for manifolds. Statistics & Probability Letters 26, 169-178. 

Hcndriks, H., Landsman, Z., 1998. Mean location and sample mean location 
on manifolds: asymptotics, tests, confidence regions. Journal of Multivariate 
Analysis 67, 227-243. 

Hotz, T., Huckemann, S., Gaffrey, D., Munk, A., Sloboda, B., 2010. Shape 
spaces for pre-alingend star-shaped objects in studying the growth of plants. 
Journal of the Royal Statistical Society, Series C 59 (1), 127-143. 

Huckemann, S., 2010a. Dynamic shape analysis and comparison of leaf growth. 
Preprint, arXiv, 1002.0616vl [stat.ME]. 

Huckemann, S., 2010b. Inference on 3D Procrustes means: Tree boles growth, 
rank-deficient diffusion tensors and perturbation models. Scand. J. Statist., 
to appear. 

Huckemann, S., 2010c. On the meaning of mean shape. Preprint, arXiv, 
1002.0795vl [stat.ME]. 

Huckemann, S., Hotz, T., 2009. Principal components geodesies for planar shape 
spaces. Journal of Multivariate Analysis 100, 699-714. 

Huckemann, S., Hotz, T., Munk, A., 2010a. Intrinsic MANOVA for Rieman- 
nian manifolds with an application to Kendall's space of planar shapes. IEEE 
Transactions on Pattern Analysis and Machine Intelligence 32 (4), 593-603. 

Huckemann, S., Hotz, T., Munk, A., 2010b. Intrinsic shape analysis: Geodesic 
principal component analysis for Riemannian manifolds modulo Lie group 
actions (with discussion). Statistica Sinica 20 (1), 1-100. 



26 



Jung, S., Foskey, M., Marron, J. S., 2010a. Comment to intrinsic shape analysis: 
Geodesic principal component analysis for Riemannian manifolds modulo Lie 
group actions. Statistica Sinica 20 (1), 63-65. 

Jung, S., Foskey, M., Marron, J. S., 2010b. Principal arc analysis on direct 
product manifolds. Submitted to the Annals of Applied Statistics. 

Jupp, P. E., Kent, J. T., 1987. Fitting smooth path to spherical data. Appl. 
Statist. 36 (1), 34-46. 

Kendall, D., 1974. Foundations of a theory of random sets. Stochastic Geom., 
Tribute Memory Rollo Davidson, 322-376 (1974). 

Kendall, D. G., 1984. Shape manifolds, Procrustean metrics and complex pro- 
jective spaces. Bull. Lond. Math. Soc. 16 (2), 81-121. 

Kendall, D. G., 1989. A survey of the statistical theory of shape. Statistical 
Science 4 (2), 87-99. 

Kendall, D. G., Bardcn, D., Carnc, T. K., Le, H., 1999. Shape and Shape 
Theory. Wiley, Chichester. 

Kendall, W. S., 1990. Probability, convexity, and harmonic maps with small 
image I: Uniqueness and fine existence. Proc. London Math. Soc. 61, 371- 
406. 

Kent, J. T., Mardia, K. V., 1997. Consistency of Procrustes estimators. Journal 
of the Royal Statistical Society, Series B 59 (1), 281-290. 

Kent, J. T., Mardia, K. V., Morris, R. J., Aykroyd, R. C, 2001. Functional 
models of growth for landmark data. In: Mardia, K. V., Aykroyd, R. G. 
(Eds.), Proceedings in Functional and Spatial Data Analysis. Leeds University 
Press, pp. 109-115. 

Kume, A., Dryden, I., Le, H., 2007. Shape space smoothing splines for planar 
landmark data. Biomctrika 94 (3), 513-528. 

Le, FL, 2001. Locating Frechet means with an application to shape spaces. Adv. 
Appl. Prob. (SGSA) 33 (2), 324-338. 

Le, H., Kume, A., 2000. Detection of shape changes in biological features. Jour- 
nal of Microscopy 200 (2), 140-147. 

Lehmann, E. L., 1997. Testing Statistical Hypotheses (Springer Texts in Statis- 
tics). Springer. 

Matheron, G., 1975. Random sets and integral geometry. Wiley Series in Prob- 
ability and Mathematical Statistics. New York. 

Morris, R., Kent, J. T., Mardia, K. V., Aykroyd, R. G., 2000. A parallel growth 
model for shape. In: Arridge, S., Todd-Pokropek, A. (Eds.), Proceedings in 
Medical Imaging Understanding and Analysis. Bristol: BMVA, pp. 171-174. 



27 



Royer, D. L., Meyerson, L. A., Robertson, K. M., Adams, J. M., 10 2009. Phe- 
notypic plasticity of leaf shape along a temperature gradient in acer rubrum. 
PLoS ONE 4 (10), e7653. 

Terras, A., 1988. Harmonic Analysis on Symmetric Spaces and Applications II. 
Springer- Verlag, New York. 

Thompson, D. W., 1917. On Growth and Form. Cambridge University Press. 

Wolfe, J., 1978. A paleobotanical interpretation of tertiary climates in the north- 
ern hemisphere. American Scientist 66, 694-703. 

Wolfe, J., 1993. A method of obtaining climatic parameters from leaf assem- 
blages. US Geological Survey Bull. 2040. 

Ziezold, H., 1977. Expected figures and a strong law of large numbers for random 
elements in quasi-metric spaces. Trans. 7th Prague Conf. Inf. Theory, Stat. 
Dec. Func, Random Processes A, 591-602. 

Ziezold, H., 1994. Mean figures and mean shapes applied to biological figure and 
shape distributions in the plane. Biom. J. (36), 491-510. 



28 



